Topic Models for Audio Mixture Analysis

نویسندگان

  • Paris Smaragdis
  • Madhusudana Shashanka
  • Bhiksha Raj
چکیده

In contrast to the time-domain waveform, time-frequency representations explicitly represent the time-varying frequency content of a sound and effectively visualize the signal’s activity at any timefrequency bin. These transforms are often complex-valued and include well known tools such as the short-time Fourier transform, constant-Q transforms, wavelets, etc. However, because our hearing system is more sensitive to the relative energy between different frequencies, for most practical applications we study the modulus of these transforms and discard the phase which is useful only in special cases. These kinds of representations are essentially counting the number of time-frequency acoustic quanta that collectively make up complex sound scenes, similar to how we count words that make up documents. With this representation we can use the analogy of bag of frequencies, which we describe later in this document.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A topic classification system based on parametric trajectory mixture models

In this paper we address the problem of topic classification of speech data. Our concern in this paper is the situation in which there is no speech or phoneme recognizer available for the domain of the speech data. In this situation the only inputs for training the system are audio speech files labeled according to the topics of interest. The process that we follow in developing the topic class...

متن کامل

Latent topic model for audio retrieval

Latent topic model such as Latent Dirichlet Allocation (LDA) has been designed for text processing and has also demonstrated success in the task of audio related processing. The main idea behind LDA assumes that the words of each document arise from a mixture of topics, each of which is a multinomial distribution over the vocabulary. When applying the original LDA to process continuous data, th...

متن کامل

Modeling Dynamic Patterns for Emotional Content in Music

Emotional content is a major component in music. It has long been a research topic of interest to discover the acoustic patterns in the music that carry that emotional information, and enable performers to communicate emotional messages to listeners. Previous works looked in the audio signal for local cues, most of which assume monophonic music, and their statistics over time. Here, we used gen...

متن کامل

jLDADMM: A Java package for the LDA and DMM topic models

The Java package jLDADMM is released to provide alternatives for topic modeling on normal or short texts. It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models.

متن کامل

Modeling long distance dependence in language: topic mixtures vs. dynamic cache models

In this paper, we investigate a new statistical language model which captures topic-related dependenciesof words within and across sentences. First, we develop a sentence-level mixture language model that takes advantage of the topic constraints in a sentence or article. Second, we introduce topic-dependent dynamic cache adaptation techniques in the framework of the mixture model. Experiments w...

متن کامل

Experiments on speech tracking in audio documents using Gaussian mixture modeling

This paper deals with the tracking of speech segments in audio documents. We use a cepstral-based acoustic analysis and gaussian mixture models for the representation of the training data. Three ways of scoring an audio document based on a frame-level likelihood calculation are proposed and compared. Our experiments are done on a database composed of television programs including news reports, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009